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Abstract. One form of the inclusion-exclusion principle asserts that if A and B 
are functions of finite sets then the formulas A(S) = J2tcs B(T) and B(S) = 
■ J2 T cs(- 1 ) lSHTlA ( T ) are equivalent. If we replace B(S)~by (-1) |5| S(S') then 

these formulas take on the symmetric form 

Q MS)= ^(-1) |T| B(T) 

•; B(S) = 2J(-i) m A(r). 

^3 . TCS 

which we call symmetric inclusion- exclusion. We study instances of symmetric 
inclusion-exclusion in which the functions A and B have combinatorial or probabilis- 
tic interpretations. In particular, we study cases related to the Polya-Eggenberger 
£NJ ' urn model in which A(S) and B(S) depend only on the cardinality of S. 
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Let A and B be two functions defined on a set D of finite sets. We assume that if 



S G D and TCS* then T G D. Then the inclusion-exclusion principle asserts that 
the following are equivalent: 



O 

0: a(s) = J2b(t). 
> 

H \ tcs 

The special case in which A(S) and B(S) depend only on \S\ is especially important: 
for two sequences (A n ) and (B n ), the following are equivalent: 



fc=0 v 7 

*. = E<-ir*(?V 



fc=0 



(2) 
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It is very easy to find pairs of functions with combinatorial interpretations that 
satisfy we may choose B to be an arbitrary function with a combinatorial inter- 
pretation. Then since expresses A in terms of B with positive coefficients, A will 
also have a combinatorial interpretation. Similarly, it is easy to find instances of (0) 
with combinatorial interpretations. 

In terms of exponential generating functions, (0) gives the familiar formulas 

F A (x)=e x F B (x) 
F B (x)=e- x F A (x), 

where 

oo „ oo „ 

F A (x) = V A n — and F B (x) = V B n —. 



n! — ' n! 

n=0 n=0 



Now in (JTJ), let us replace -B(T) with (— 1)' T I/3(T) and with a(S), and multiply 
the second equation by (— l)' 5 '. We obtain 

= £(-i) |r| /*cr) 

(3) 



TCS 



= £(-i) |T| «co. 

TCS 

The corresponding substitution in (J2J) gives 

n 

a. = E(-i 



(4) 

n \ 

k=0 

In terms of exponential generating functions, (j3J) is the unfamiliar-looking 

F a (x) = e x F p {-x) 

F p {x) = e x F a {-x) { ] 

We will call functions or sequences A and B that satisfy (0) and (0) asymmetric 
inclusion- exclusion pairs and we will call instances of Q and symmetric inclusion- 
exclusion pairs. The goal of this paper is to study symmetric inclusion-exclusion pairs 
with combinatorial or probabilistic interpretations. We shall see that for the case in 
which the domain of the functions a and j3 in Q is the set of subsets of a finite set, 
or if the sequences (a n ) and (f3 n ) in (j3J) are finite (i.e., a n and (3 n are defined only 
for n < N, for some N), there is a very simple construction of symmetric inclusion- 
exclusion pairs analogous to starting with an arbitrary function B in (0) or with 
an arbitrary sequence (B n ) in (J2J). We also give a very simple probabilistic setting 
for (JSJ) in which the cardinalities of the sets in the domain of a and (3 are unbounded. 
Finding a probabilistic interpretation of (0J) with infinite sequences does not seem so 
straightforward. We discuss some examples, related to the Polya-Eggenberger urn 
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model, of infinite sequences satisfying (jlj), but we have not found a general theory of 
such sequences. 

2. Difference Tables 

There is a close connection between asymmetric inclusion-exclusion pairs and sym- 
metric inclusion-exclusion pairs. We illustrate with the special case of the derange- 
ment numbers. We take A n = n\, so B n = X]fc=o( — i ) n ~ fc (fc)^' * s ^ ne n ^ n derangement 
number. We can compute B n from A n by using a difference table, in which the num- 
bers A n = n\ appear in the zeroth row and each number in a row below the zeroth 
row is the number above it to the right minus the number above it to the left. 

1 1 2 6 24 120 

1 4 18 96 

1 3 14 78 
2 11 64 

9 53 
44 

Then the numbers B n = 1, 0, 1, 2, 9, 44, . . . appear in the zeroth diagonal. Now let us 
rotate this triangular array 60° counterclockwise, obtaining 

120 
24 96 
6 18 78 

2 4 14 64 

1 1 3 11 53 

1 1 2 9 44 

In this array, every number above the bottom row is the sum of the two numbers 
below it. Let us define the numbers = (5 — i)\ to be the numbers in the left 
diagonal and let us define the numbers /3, = 120, 96, 78, ... to be the numbers in the 
right diagonal. It is not hard to check that the sequences (a n ) and (/3 n ) satisfy (HJ) for 
n < 5. We also note that these sequences have combinatorial interpretations: «j is the 
number of permutations of {1, 2, 3, 4, 5}, in which 1,2, . . . , i (or any i numbers from 
{1, 2, 3, 4, 5}) are all fixed points and $ is the number of permutations of {1, 2, 3, 4, 5} 
in which 1, 2, . . . , i (or any i numbers from {1, 2, 3, 4, 5}) are all nonfixed points. 
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The general situation is described by the following theorem, which can be proved 
directly by a straightforward computation that we omit, or as a consequence of The- 
orem |21 below. 

Theorem 1. Let B , Bi, . . . , Bn be arbitrary and let a n and (3 n for < n < N be 
defined by 




Then the sequences (a n ) and (f3 n ) satisfy for < n < N . 

We have a similar result that allows us to construct instances of (0) with combi- 
natorial interpretations. The proof is also a straightforward verification, which we 
omit. 

Theorem 2. Let A be a finite set, and let B be a function defined on the subsets 
of A. Let the functions a and (3 be defined on subsets of A by 

a{S) = 5>(T) 

TCS 

/3(s) = j2_ B (n 

TCS 

where the complements are with respect to A. Then the functions a and (5 satisfy (jHJ). 

We note that Theorem ^ is equivalent to the case of Theorem El in which B(T) 
depends only on the cardinality of T. 

Theorems [D and El allow us to construct instances of (jHl) and (jlj) with combinatorial 
interpretations. However, in any example constructed in this way, the cardinalities 
of the sets involved are bounded and cannot in general be extended to unbounded 
cardinalities. 

For example, if we want to extend the triangular array given before Theorem ^ 
by adding a row at the bottom consisting of nonnegative real numbers, the in the 
bottom row forces both numbers below it be 0, and the next row is then forced to 
have three consecutive 0's. We then have the array 
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120 
24 96 
6 18 78 
2 4 14 64 
1 1 3 11 53 
1 1 2 9 44 
1 1 1 8 36 
1 1 8 28 

to which it is impossible to add another nonnegative row at the bottom. 

In the next section we discuss a setting for (jSJ) in which the sets may have unbounded 
cardinalities and the functions a and /3 are probabilities rather than integers. 



3. General symmetric inclusion-exclusion 

We consider a probability space containing sets Ei, where % ranges over an index 
set A, which in our examples will be the set of positive integers. Thus we have a 
probability function P, defined on all sets generated from the Ei by complements and 
finite unions and intersections, with the property that if S and T are disjoint then 
P(S U T) = P(S) + P(T). (The usual definition of a probability function requires 
nonnegativity and countable additivity, and that the probability of the whole space 
is 1, but these conditions are not necessary for our results.) For every finite subset 
S C A we set 

a(5)=p(f|J^) (8) 

(3(S)=P({}E 1 ). (9) 

ies 

Theorem 3. The functions a and (3 defined by (JSJ) form a symmetric inclusion- 
exclusion pair; i.e., 

TCS 

w = E(- 1 ) |T| «( T )- 

TCS 
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Proof. By symmetry, it is enough to prove the second formula. Without loss of gen- 
erality, we may assume that S = {1, 2, . . . , m}. A well-known form of the inclusion- 
principle (see, e.g., |H1 P- 6]) asserts that 

P{E 1 U £ 2 U • • • U £ m ) 

= p ( E i) - p ( E * n £,■) + •-• + (-i) m - 1 P{E 1 n • • ■ n E m ). 

l<i<m l<i<j<m 

This is equivalent to the desired formula. □ 
We note that if A is finite, Theorem El may be derived from Theorem 121 by taking 

b(t)=p(( n *)n(n^)Y 

V ieA-T i€T 7 

In the general case, we can deduce Theorem El from Theorem |21 by restricting to a 
finite subset of A that contains S. 

The most interesting examples of Theorem El are those in which a(S) and (3(S) 
depend only on the cardinalities of S. Before discussing these examples, we give an 
example which is not of this type. We consider the set of infinite sequences (r 1; r 2 , . . . ) 
of real numbers between and 1, and we assign probabilities in the obvious way to 
subsets defined by a finite number of inequalities among the ?v Let us take Ei to 
be the set of sequences satisfying < r i+1 . Then for any set S of positive integers, 
a(S) is the probability that < r i+ i for all i G S. To give a formula for a(S), we 
express S as a disjoint union of blocks of consecutive integers with gaps in between: 
S = V\ U V 2 U • • • U V m , where Vi = {ui, Ui + 1, . . . , Ui + s» - 1} and u i+ i > Ui + Sj. 
Then 



v ; ( Sl + l)!( S2 + l)!...( Sm + l)!' 
and by symmetry, f3(S) — a(S). 

4. A SIMPLE EXAMPLE 

We now give a very simple example of Theorem El in which a(S) depends only on 
the cardinality of S, so that we have an instance of We flip a coin infinitely many 
times. Each flip comes up heads with probability p and tails with probability 1 — p. 
For «GP = {1, 2, 3, ... } we let Ei be the event that the ith flip is a head. Then for 
any finite subset 5CP 

a(S) = P^f\E i 
P(S) = p(f)E l 

Then Theorem E] gives 

n 

(i-p) n = E( 

k=0 
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and (0) is the trivial formula e^ 1 p ^ x = e x e px . The example of the previous section, 
when all of the Si are equal to 1, is equivalent to the case p = 1/2 of this example. 

5. The Polya-Eggenberger urn model 

In the previous example, the events Ei were independent, which makes the situation 
easy to analyze, but not very interesting. Here we give a more interesting case of 
symmetric inclusion-exclusion sequences in which the Ei are not independent. It is 
based on a classical probabilistic model called the Pdlya or Polya-Eggenberger urn 
model P p. 120-121]. 

We have an urn that initially contains r red balls and b black balls. At each step 
we choose a ball at random from the urn. (So the probability of picking a red ball at 
the first step is rjir + b) and the probability of choosing a black ball is b/(r + b).) 
We then replace the ball we have picked and add another ball of the same color to 
the same urn. We repeat this procedure forever. 

The model is easily adapted to the case in which r and b are positive real numbers 
that are not necessarily integers. Often in descriptions of the model in the literature, c 
balls are added rather than one, but adding only one ball gives simpler hypergeometric 
formulas, and the case of adding c balls at each step with an initial inventory of r red 
and b black balls is equivalent to adding one ball at each step with an initial r/c red 
balls and b/c black balls. 

An easy induction argument shows that the probability that the first m + n balls 
are any particular sequence of m red and n black balls is 

{r) m {b)n 
(r + b) m+n 

where {u) n = u{u + 1) • • • (u + n — 1). 

By ball i, we shall mean the ball chosen at the ith step. Then a consequence of 
the previous formula is that for any disjoint sets of integers R — i 2 , ■ ■ ■ , i m } and 
B = {jx, j 2 , . . . , j n }, the probability that the balls with numbers in R are red and the 
balls with numbers in B are black depends only on m and n, and is therefore 

(r)rn(b) n 

(r + b) m+n 

We note that this urn model is equivalent to a lattice path model in the plane, 
where a particle starts at the origin, and from the point it moves right with 

probability (r + i)/(r + b + i + j) and up with probability {b + j)/(r + b + i + j). 
Then the probability that any particular path ending at (m, n) has been taken is 
[f )ra (b) n /(r + b) 

m+n ■ 

Now let Ei be the event that ball i is red, so Ei is the event that ball i is black. 
Then with the notation of Section for any set S of positive integers, a(S) is the 
probability that all the balls with numbers in S are red, and 
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Similarly, (3(S) is the probability that ball % is black for all % in S, and 

P(S) = r %-. 

(r + b)\ S \ 

So if we take S to be a set of size n, Theorem El gives 



(6)n 



(6 



(r) fe 



(6 + r); 



and the same identity with 6 and r switched. 

This identity is a form of the Chu-Vandermonde summation theorem. The corre- 
sponding exponential generating function identity, 

(6)n x n .A 1VI (r) fc 



£ 

n=0 



(6 + r) n n! 



£(-i; 



is the well-known !.F\ transformation 

6 



1*1 



b + r 



n=0 



(6 + r)k n\ 



r 

b + r 



where the hypergeometric series is defined by 

'iti, ...,u t 



Vl, 



OO / \ 



-X 



n=0 



{Vq)n Til 



More generally, we can take m urns, where the zth urn starts with red balls and 
6j black balls. At each step, we choose a ball at random from each urn, replace it, 
and add another ball of the same color. What is the probability that if we do this n 
times, at each step we choose at least one black ball? We let Ei be the event that at 
the ith step all the balls chosen are red. Then for any finite set S C P, a(S) is the 
probability that for the steps in S, all of the balls chosen at are red. If \S\ — n, this 
probability is 

(njn (r 2 ) n (On 

(n + bi) n (r 2 + b 2 ) n (r m + b 
So the probability that in n steps at least one black ball is chosen at each step is 

(ri)jt (r 2 ) fc (r n ) fc 



w =£(-!)'(*) 



121 



for IS 1 ! 



n, which may be written as the hypergeometric series 
-n, ri, r 2 , • • ■ , r m 



n +h,r 2 + b 2 , 



r, 



(13) 



As a corollary, we get that this hypergeometric series is positive, as long as the fj 
and bi are positive real numbers. It is not difficult to prove this result analytically, 
using the integral representation 

(r) k T(r + b) ' 



(r + b) k r(r)r(6) 



x k+r-i (1 



X 



,6-1 
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However, the combinatorial approach gives a stronger result: 

Theorem 4. For each nonnegative integer n, the hypergeometric series (jl3j) can 
be expressed as a quotient of polynomials in the variables rx, . . . , r m , b\, . . . b m with 
positive coefficients. 

Proof. We may assume that the variables are positive real numbers, so that the proba- 
bilistic interpretation given above applies. Then we can compute the probability (|12j) 
in another way by summing the probabilities of all possible outcomes in which at 
least one black ball is chosen at each step, and (fTUj) implies that each such probability 
is a rational function of the desired form, and so, therefore, is their sum. □ 

In the case m = 2, the sum described in the proof of Theorem 0] is simple enough 
to write out explicitly. We can describe the colors of the two balls chosen at each 
step as (R,B), (B,R), or (B,B), where R denotes red and B denotes black. The 
number of sequences of allowable choices in which (R, B) occurs i times, (B, R) occurs 
j times, and (B,B) occurs k times is the trinomial coefficient (i + j + k)/(i\j\k\). 
The probability of such a sequence is, by (fTUj). 



(ri)i(6i)j +fc (r 2 )j{b 2 ) 



2 )i+k 



i+j+k (r-2 + b 2 ) i+j+k 

and thus we have the identity 



3-^2 



-n, rx, r 2 
rx + bx,r 2 + b 2 



x \ = nl (ri)i(bi) j+k (r 2 )j(b 2 ) i+k 

A slightly different approach allows us to express the double sum on the right side 
of (|14|) as the single sum 



n\ (ri)j(bi) ra _i(b 2 )i 
v (n + h) n (r 2 + b 2 )i 

For each possible sequence of i red and n — i black balls chosen from the first urn, the 
probability of this sequence is {rx)i(bx)n-i/ { r i + b\) n by (JTUJ). For the choices from 
the second urn to be compatible, whenever a red ball is chosen from the first urn a 
black ball must be chosen from the second urn, but when a black ball is chosen from 
the first urn, the color of the ball from the second urn is unrestricted. Thus given 
a sequence of i red and n — i black balls chosen from the first urn, the probability 
that the choices from the second urn are compatible with those from the first urn is 
(b 2 )i/(r 2 + b 2 )i. Summing over all possible choices for the first urn gives (|15|). 

Expressing (|15j) as a hypergeometric series, we may write the identity we have 
proved as the 3_F 2 transformation 



3-^2 



-n, rx, r 2 
rx + bx, r 2 + b 2 



{b\) n p f-n, rx, b 2 
(n + h)^ 2 \ \-b x -n,r 2 + b 2 



1 



which is equivalent to formula (3.1.1) of [3]. 
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A special case of the sum in (fT2|) deserves mention. Suppose that r\ = r 2 = ■ ■ ■ = 
r m = r and b\ = b 2 = • • ■ = b m = 1. Then since 

(r)k r 
(r + l) fe r + fc' 

the sum is 




One of the interesting properties of U m {r) is the generating function 

( r »")£ ff ~ (r) ®" = ns<w.-) 

which is easily verified by partial fraction expansion. 

These numbers (up to a constant factor) have appeared in several places in the 
literature. The case r = 1 of these numbers has been studied by Smiley [7j, who men- 
tions a combinatorial interpretation equivalent to ours and gives further references. 
The case r = 2 was considered by Foata, Han, and Lass 0] in connection with a 
coup on- collecting problem, and the general case was considered by Laforest [H] and 
by Flajolet et al. [3 El m the study of quadtrees. A combinatorial connection be- 
tween the coupon-collecting and quadtree interpretations and that discussed here is 
not apparent. 
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